CDS

Accession Number TCMCG075C00780
gbkey CDS
Protein Id XP_017971399.1
Location join(3356687..3356844,3356934..3357024,3357126..3357280,3357486..3357588,3357824..3357919,3358169..3358234,3358347..3358466,3358566..3358649)
Gene LOC18611250
GeneID 18611250
Organism Theobroma cacao

Protein

Length 290aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018115910.1
Definition PREDICTED: probable prolyl 4-hydroxylase 9 isoform X4 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category E
Description prolyl 4-hydroxylase
KEGG_TC -
KEGG_Module -
KEGG_Reaction R01252        [VIEW IN KEGG]
KEGG_rclass RC00478        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K00472        [VIEW IN KEGG]
EC 1.14.11.2        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00330        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
map00330        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
GOs GO:0007275        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0009653        [VIEW IN EMBL-EBI]
GO:0009888        [VIEW IN EMBL-EBI]
GO:0009987        [VIEW IN EMBL-EBI]
GO:0010015        [VIEW IN EMBL-EBI]
GO:0010053        [VIEW IN EMBL-EBI]
GO:0010054        [VIEW IN EMBL-EBI]
GO:0021700        [VIEW IN EMBL-EBI]
GO:0022622        [VIEW IN EMBL-EBI]
GO:0030154        [VIEW IN EMBL-EBI]
GO:0032501        [VIEW IN EMBL-EBI]
GO:0032502        [VIEW IN EMBL-EBI]
GO:0048364        [VIEW IN EMBL-EBI]
GO:0048468        [VIEW IN EMBL-EBI]
GO:0048469        [VIEW IN EMBL-EBI]
GO:0048731        [VIEW IN EMBL-EBI]
GO:0048764        [VIEW IN EMBL-EBI]
GO:0048765        [VIEW IN EMBL-EBI]
GO:0048856        [VIEW IN EMBL-EBI]
GO:0048869        [VIEW IN EMBL-EBI]
GO:0071695        [VIEW IN EMBL-EBI]
GO:0080147        [VIEW IN EMBL-EBI]
GO:0090558        [VIEW IN EMBL-EBI]
GO:0090627        [VIEW IN EMBL-EBI]
GO:0099402        [VIEW IN EMBL-EBI]
GO:1905392        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGAAATGTTCGACGATTATACACCAAAAAAGAAGGCCAAAAGCAGAGCCTATTCCTTGTGCTGGAACTCCAAAGCAAACATCGGGTTTCCTGGCGTTTTTCTCTTCTGTTGTTTATTCTTCCTTGCTGGTTTCTTTGCTTCCAACCTTCTTTCTCAGGCAGAGAGGGAGGAGGCTCGAGTCACTGGACTATGACTTGATGGCACATGGAGAAACCGGAGATGATTCTGTTTCTGTCATTCCTTTTCAGGTTATAAGCTGGGGGCCGCGTGCCTTCTATTTTCCCAACTTTGCAACTCCAGCGCAATGCCAACACATAATTGACATGGCAAAACCAAAACTTGAACCATCAACGGTGCTTTTAGCAAAGGGAGAAACCCAGCAGCCAAATGATGTTAGAACAAGTATGGGTACATTTCTCAGTGCTTATGAAGATGAGACTGGGGTTTTGGATGACATTGAGGAAAAGATTGCAAAGGCAACGAAGCTACCAAGAGTTAACTACGAGGCATTCAATGTCTTGCGCTATGGAGTAGGACAGAAATATGATTCGCATTATGATGTGTTTGATCCTGAGCGGTATGGCCCTCAAAAGAGCCAAAGGGTTGCAACCTTCTTGCTGTACTTATCAGATGTTGAAGGAGGAGGGGAAACCGCATTTCCATTTGAGGATGGCTTGAATATGGATGAAAATTATGATGTCAAAAAATGTATTGGCCTGAAAGCAAAGCCTAGCCTAGGAGATGGACTTCTATTTTATTCATTGTTCCCCAATAATTCGATCGATCCAACATCAACTCATGGGAGCTGTCCAGTAATCAAAGGGGCAAAATGGGTGGCTACAAAGTGGATCAGAGATCAGCAAGACTTTTAG
Protein:  
MKCSTIIHQKRRPKAEPIPCAGTPKQTSGFLAFFSSVVYSSLLVSLLPTFFLRQRGRRLESLDYDLMAHGETGDDSVSVIPFQVISWGPRAFYFPNFATPAQCQHIIDMAKPKLEPSTVLLAKGETQQPNDVRTSMGTFLSAYEDETGVLDDIEEKIAKATKLPRVNYEAFNVLRYGVGQKYDSHYDVFDPERYGPQKSQRVATFLLYLSDVEGGGETAFPFEDGLNMDENYDVKKCIGLKAKPSLGDGLLFYSLFPNNSIDPTSTHGSCPVIKGAKWVATKWIRDQQDF